D-MAP: a distance-normalized MAP estimation of speaker models for automatic speaker verification
نویسندگان
چکیده
In this paper we introduce a MAP estimation of speaker models in Automatic Speaker Verification with a distance constraint: the D-MAP adaptation. The D-MAP is based on the Kullback-Leibler distances and provides an easy way to automatically compute a speaker-dependent adaptation of the model parameters. We formulate a distance constrained MAP criterion and we show an equivalence between the D-MAP adaptation and the score normalization called D-Norm. From the results obtained with the D-MAP technique, we show that this method gives better performance than a classical speakerindependent MAP adaptation. It is also found that the D-MAP based system without score normalization performs similarly to a classical MAP system with a modelbased score normalization.
منابع مشابه
Speaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation
A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...
متن کاملSpeaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation
A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...
متن کاملDiscriminative adaptation for speaker verification
Speaker verification is a binary classification task to determine whether a claimed speaker uttered a phrase. Current approaches to speaker verification tasks typically involve adapting a general speaker Universal Background Model (UBM), normally a Gaussian Mixture Model (GMM), to model a particular speaker. Verification is then performed by comparing the likelihoods from the speaker model to t...
متن کاملQuantitative influence of speech variability factors for automatic speaker verification in forensic tasks
Regarding speaker identity in forensic conditions, several factors of variability must be taken into account, as peculiar intra-speaker variability, forced intra-speaker variability or channel-dependent external influences. Using ‘AHUMADA’ large speech database in Spanish, containing several recording sessions and channels, and including different tasks for 100 male speakers, automatic speaker ...
متن کاملExploiting GMM-based Quality Measure for SVM Speaker Verification
In this paper, we examine the problem of quality measurement for speaker verification using support vector machines (SVMs). An efficient Gaussian mixture models (GMMs) based quality estimation algorithm is proposed to potentially utilize speaker-specific broad acoustic-class characteristics. Some verification strategies are also considered in the test phase. We perform clustering-based vector p...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003